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Abstract. In this paper we study the problem of content placement in a cache 
network. We consider a network where routing of requests is based on random 
walks. Content placement is done using a novel mechanism referred to as “rein¬ 
forced counters ”. To each content we associate a counter, which is incremented 
every time the content is requested, and which is decremented at a fixed rate. 
We model and analyse this mechanism, tuning its parameters so as to achieve 
desired performance goals for a single cache or for a cache network. We also 
show that the optimal static content placement, without reinforced counters, is 
NP hard under different design goals. 


1. Introduction 

In today’s Internet the demand for multimedia files and the sizes of these files are steadily 
increasing. The popularity of Youtube, Dropbox and one-click file sharing systems, such 
as Megaupload, motivate researchers to seek for novel cost-effective content dissemina¬ 
tion solutions. 


Caching is one of the most classical solutions to increase the load supported by 
computer systems. In essence, caching consists of transparently storing data in a way 
that future requests can be served faster. Caching in the realm of standalone computer 
architectures received significant attention from the research community since the early 
sixties, and web-caching has also been studied for at least one decade. The objects of 
study of this work, in contrast, are cache networks, which were proposed and started to 
receive focus much more recently [Zhang and Carofiglio 20 12p Jacobson et al. 200911. 


Cache networks comprise two main features: caching and routing, both performed 
by the same core component: a cache router. In a cache network, content traverses the 
network from sources to destinations, but can in turn be stored in caches strategically pla¬ 
ced on top of the routers. The data plane is responsible for transmitting contents whereas 
the control plane is responsible for the routing of requests. The system considered in this 
paper is illustrated in Figure HJ A request for file F is routed using random walks through 
the caches. Every time the request hits a cache, the cache selects uniformly at random 
one of the links that are incident to the cache and uses it to forward the request. Once 
the request reaches a cache where the content is stored, the content is transferred to the 
requester through the data plane. We assume that all contents are stored in at least one 
cache. 


In this paper we study the problem of content placement in cache networks. We 
pose the following questions: (1) How to store and evict contents from a cache in a 
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Figura 1. Illustration of how system works (cache-router symbol introduced 
in [Rosensweig et al. 201 3| ). (a) An exogenous request for file F is placed and 
processed in cache 1; (b) the request is routed through the control plane, and 
yields an endogenous request to cache 2; (c) the request is processed in cache 
2; (d) the request is routed through the control plane to cache 4. The file is found 
and is transferred to the user through the data plane. 


flexible and scalable manner? (2) How to efficiently and distributedly place content in a 
cache network? 


Content placement is done using a novel mechanism referred to as “reinforced 
counters” [ Domingues et al. 2013[ Domingues et al. 2014|. To each content we associate 
a counter, which is incremented every time the content is requested, and which is decre¬ 
mented at a fixed rate. We model and analyze this mechanism, tuning its parameters so as 
to achieve desired performance goals for a single cache or for a cache network. 


Each cache in a cache network has a given service capacity. The service capacity 
of a cache is related to the time it takes to 1) find content in the cache, 2) return that 
content to the user, through the data plane, in case of a cache hit and 3) route the request 
to another cache, through the control plane, in case of a cache miss. Note that in the 
two latter cases, the service capacity of a cache network accounts for network delays due 
to transmission and queueing. A cache is stable if, for a given workload, the queue of 
pending requests does not grow unboundedly with respect to time. A cache network is 
stable if, for a given workload, all its caches are stable. We show that the optimal static 
content placement, without reinforced counters, is NP hard under different design goals 
accounting for stability and content availability. 


To summarize, in (partially) answering the questions above we make the following 
contributions 


• we introduce reinforced counters as a way to flexibly store and evict contents from 
a cache, showing that they are amenable to analytical study and optimal tuning 

• we propose a new formulation of the optimal content placement in cache networks 
accounting for stability. Then, we show that static content placement, without 
reinforced counters, is NP hard, which motivates the use of reinforced counters or 
variants in a network setting. 



































































The rest of this paper is organized as follows. Seetion ^studies the single eaehe 
poliee based on the reinforeed eounters. In ^we address the problem of multiple eaehes 
and we show that under different setting the problem at hand is NP hard. In 0we present 
related work, and ^eoneludes. 

2. Single Cache: Reinforced Counters and Flexible Content Placement 

In this section, we study placement and eviction policies for a single cache under the 
assumption that the dynamics of each content is decoupled from the others. Decoupled 
content dynamics yields tractable analysis and can be used for approximate systems with 
fixed capacity. They are also of interest in the context of DNS caches and the novel 
Amazon ElastiCache system (http : / /aws . amazon . com/elasticache/). 

In the policies to be introduced in this section, the expected number of items in 
the system can be controlled. Let be the probability that each content is stored in the 
cache. Given a collection of N contents the expected number of stored contents is Nn^p, 
which can be controlled or bounded according to users needs. In what follows, we con¬ 
sider the problem of controlling vr^p using a simple mechanism referred to as reinforced 
counters (Section [2TI) . In Section 1X2] we illustrate how the mechanism works through 
some simple numerical examples. Then, in Section [23] we extend the reinforced counters 
to allow for hysteresis, showing the benefits of hysteresis for higher predictability and 
reduced chances of content removal before full download. 


2.1. Reinforced counter with a single threshold 


We consider a special class of content placement mechanisms, henceforth referred to 
as reinforced counters [ Domingues et al. 2013[ Domingues et al. 2014|. To each content 
we associate a reinforced counter, which is increased by one every time the content is 
requested, and is decreased by one as a timer ticks. Let K be the reinforced counter 
eviction threshold. Whenever the counter is decremented from K +l{.o K the content is 
evicted. The timer ticks every l//i seconds. Henceforth, we assume that the time between 
ticks is exponentially distributed. This mechanism is similar to TTL caches, employed 
by DNS and web-caching systems [ jBerger et al. 2014]ILofack et al. 201211 with additional 
flexibility to allow for content to be inserted in the cache only after the threshold K is 
surpassed. The advantages of using a threshold K over existing mechanisms will be 
shown in this section. 

We assume that the behavior of each of the contents is decoupled. This assumption 
is of interest in at least three scenarios. 


• Cache capacity is infinite - In the cloud, we may assume that storage is infinite. 
Users might incur costs associated to larger amounts of stored content, but the 
storage space itself is unlimited. Therefore, costs are proportional to the space 
used, and constraints are soft (as opposed to hard). 

• Approximation of the behavior of fixed capacity system - Under the mean field ap¬ 
proximation PLofack et al. 20121 . it has been shown that decoupling the dynamics 
of multiple contents can lead to reasonable approximations to the content hit pro¬ 
babilities. In this case, instead of considering that the cache has a fixed capacity, 
it is assumed that there are constraints on the expected number of items in the 
cache. Note that under this approximation the cache capacity design problem can 
















be compared to the problem of calculating the capacity of communication lines in 
a telephone network. 

• Time-to-live caches - Roughly, when capacity is finite, one can take advantage 
of statistical multiplexing either by evicting items from the cache when (a) an 
overflow occurs or (b) when the item expire. In the latter case, if the probability 
of cache overflow is low, the approximate analysis of hit probabilities can be done 
assuming that each item is decoupled from the others UFofack et al. 20121 . This 
is particularly relevant in the context of time-to-live caches (such as those used 
by the DNS system) where entries need to be renewed from time to time to avoid 
staleness. 

Our goal in the remainder of this section is to show the advantages of having a 
content placement mechanism with two associated knobs, /r and K, allowing for the user 
to fine tune both the fraction of time in which the content is in the cache (steady state 
metric) while at the same time controlling the mean time between content insertions (or, 
equivalently, controlling the rate at which content is evicted or brought back into cache) 
so as to avoid that content is replaced too fast and content starvation (that is, content is 
never included into cache or removed from it during a finite but large time interval). The 
timer tick rate jj, and the reinforced counter threshold K can be tuned so as to adjust the 
long term fraction of time in which the content is stored in the cache and to guarantee that 
the mean time between content eviction and content reinsertion into the cache is bounded. 

In what follows, we assume that requests for a given content arrive according to a 
Poisson process with rate A. A is also referred to as the content popularity. (Recall that 
the behavior of each content is decoupled for others.) Figure |2] is useful to illustrate the 
different intervals of the cache content replacement and the notation we use. The blue 
(red) intervals in the figure indicate that the content is stored (or not) in cache. If content 
is not in cache it is brought into cache when a new request for it arrives and the reinforced 
counter is at the threshold K. On the other hand, if the value of the reinforced counter is 
dX K + 1 and the counter ticks, the content is removed from cache. Note that we adopt 
the same assumption as in UFofack et al. 20121 : the insertion and eviction of a content is 
not influenced by other contents in the same cache. As mentioned above, fixed storage in 
cache is modeled by considering the expected number of contents into the cache. 



timer ticks timer ticks 


Figura 2. Request counter and notation. 


Let TCup be the fraction of time in which the content is in the cache. Each cache 
content alternates from periods of inclusion and exclusion from cache (see Figure [2l), and 
we have (renewal theory). 


'^up 


E[B] 


E[R] + E[B] 


( 1 ) 


where E[R] is the mean time that a content takes to return to the cache once it is evicted 









and E[B] is the mean time that the eontent remains in the eaehe after insertion. It should 
be elear that the reinforeed eounter meehanism ean be modeled as an M/M/1 queueing 
system, with state equal to the value of the reinforeed eounter. Then, 

OO 

T^up = (1 - P)P' = P^^^ ( 2 ) 

i=K+l 


where p = \/p. 

Let 7 (if, p) be the rate at whieh eontent enters the eaehe. Due to flow balanee, in 
steady state, 7 (if, p) equals the rate at whieh eontent leaves the eaehe. 


7(A» = -p) = xppl -p)= j (3) 


where the last equality ean be easily inferred from Figure [2] (renewal arguments). 

E[B] ean be ealeulated from first passage time arguments, that is the time it takes 
from the system to return to state K (evietion) onee it is brought into system (state (if + 
1)). From the M/M/1 model, it ean also be ealeulated by the busy period of an M/G/1 
queue, whieh equals 

E[B] = l/{p-X). (4) 


Then, from dH) and O 


l/(/i-A) 
l/{p-X) + E[Ry 


Onee the eontent is evieted the mean time for it to return to the eaehe is given by 


(5) 


E[R] = (1 - TTup) / {TTupiP - A)) 


( 6 ) 


Given a fixed it up, it is possible to write p, p, E[R] and 7 as a funetion of if. 


p 

_ ^1/{K+1) 

up 

(7) 

p 


(8) 

E[R] 

= (1 - 7r„p)/(7r„p(A(7r„p)"^/(^+^) - A)) 

(9) 

7 

= A7r„p((7r„p)“^/(^+^^ - 1) 

(10) 


Note that there is a tradeoff in the ehoiee of if, as inereasing the value of if 
reduees the rate at whieh eontent is inserted into the eaehe (see equation ([3])), whieh in 
turn reduees the steady state eosts to download the eontent from external sourees. The 
larger the value of if, the smaller the steady state rate at whieh the eontent enters and 
leaves the eaehe. But inereasing the value of if, also inereases the mean time for the 
eontent to be reinserted into the eaehe onee the eontent is evieted. The larger the value 
of if, the longer the requesters for a given eontent will have to wait in order to be able to 
download the eontent from the eaehe after it is evieted (see equation ([91)). 




Let Kmax be the maximum value allowed for K. Motivated by the tradeoff above, 
given a fixed value of 7r„p we eonsider the following optimization problem, 


min 

K 

fi{K) = ay + I3E[R] 

(11) 

such that 


(12) 

K 

< K 

— max 

(13) 

'^up 

= 

(14) 

p 


(15) 


a and f5 are used to eontrol the relevance of long term and short term dynamics. The 
long term dynamics reflect the behavior of the system after a long period of time, during 
which the rate at which content enters the cache is given by (1 — p). The short term 
dynamics reflect the behavior of the system during a shorter period of time, during which 
one wants to guarantee that the mean time it takes for the content to return to the cache 
is not too large. We should keep in mind that we choose Hup to satisfy cache capacity 
limitations and system performance. 

Substituting (TTlj-lfT^ into (fTTI) . the objective function is given by. 


= a A7r„p((7r„p) - 1) + (5{l- vr^p) 




(vr.p)-V(i^+i) - 1 


(16) 


The value of E[R] must be bounded so as to avoid starvation, as formalized in the 
propositions below. 

Proposition 2.1. If (3 = 0, the optimal strategy consists of setting K = Kmax- ^f Kmax = 
oo, it will take infinite time for the content to be reinserted in the cache once it is evicted 
for the first time. 


Proof: The objective function is given by 

'fi{K) = aA7r„p((7r„p)"^/(^+^^ - 1) 

The derivative of the expression above with respect to K is 

d'ip{K) Alog( 

up)'^ Up 

dK ^"(iL+l)2(7r„p)V(^^+i) 

which is readily verified to be always negative. Therefore, the minimum is reached when 
K = Kmax- When K = oo ii follows from ([8]) that p tends to 0. The mean time for 
reinsertion of the content in the cache is given by (|9l) which grows unboundedly as p 
tends to 0. □ 

Proposition 2.2. If (3 > 0 the optimization problem (fTTI) - (fT5l) admits a unique mini¬ 
mum K*. 


(17) 


(18) 


The proof is omitted for conciseness. 











Figura 3. TVup = 0.9, a = /3 = 1 


2.2. Illustrative Example 

Next, we eonsider an illustrative example to show the tradeoff in the ehoiee of K. Let 
T^up = 0.9 and a = (3 = 1. Figure |3] shows how eost first deereases and then inereases, 
as K inereases. The optimal is reaehed for K = 10. At that point, we have E[R\ = 0.31 
and 7 = 0.32. 

Remark 1: Note that in problem (fTTI) - (fT5]) we do not set a hard eonstraint on 
E[R\. Alternatively, we eould add sueh a eonstraint, E[R\ < R*. The solution of the 
modified problem is either the solution of the problem without the eonstraint (in ease the 
eonstraint is inaetive) or the value of K whieh yields E[R] = R* (in ease the eonstraint is 
aetive). 

Remark 2: If follows from Markov inequality that the solution of the pro¬ 
blem (fTTI) -(fT5]) naturally yields a bound on the probability P{R > r), i.e., P{R > r) < 
E[R]/r. In the numerieal example above, when K is optimally set we have that the pro¬ 
bability that the the eontent is not reinserted into eaehe after 3.1 units of time following an 
evietion is P{R > 0.31 x 10) < 0.1. In the numerieal results we present latter P{R > r) 
is ealeulated exaetly from the model. 

Remark 3: We assume that K ean take real values. If K is not an integer, we ean 
always randomize between the two elosest integers when deeiding whether to store or not 
the eontent. 

2.3. Reinforced counter with hysteresis 

In this seetion we generalize the reinforeed eounter to allow for a third eontrol knob 
Kh as follows. The eounter is ineremented at eaeh arrival request for a eontent and is 
deeremented at rate l//i. In addition, the eontent is ineluded into eaehe when the value 
of the reinforeed eounter is ineremented to iF -f 1, as in previous seetion. However, 
eontent is not removed from eaehe when the eounter value is deeremented from K + 1 
to K. Instead, eontent remains in eaehe until the eounter reaehes the (new) threshold 
Kh- Figure IH illustrates the behavior of the new reinforeed eounter. From the figure, 
we observe that the purple intervals eorrespond to the values of the reinforeed eounter 
between K and and the eontent is in eaehe. The bottom of Figure |4] shows a trajeetory 
of the reinforeed eounter that motivates the name hysteresis. The displayed trajeetory is 
i —^ Kh —^ Kfi -|-1 — }■ K — y K -f 1 —^ j —^ Kh -f 1 —^ I ete. 

At first glanee, it seems that only the intervals where the eontent is in eaehe are 
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Figura 4. Reinforced counter with hysteresis. 

affected by this new mechanism (the blue intervals) but not the intervals where content 
is absent (the red intervals). However, this is not true and both intervals are affected. 
In what follows, we show how to calculate the expected values of E[B] and E[R\ and 
how the measures of interest are affected by this new mechanism. We also show the 
advantages of the reinforced counter with hysteresis. For that, we refer to Figure |5] that 
shows the Markov chain for the hysteresis counter. 



Figura 5. Reinforced counter with hysteresis: state transition diagram. 

Let v{i + 1) = E[B\ and + 1) = E[R\ both when K — = i- Note that v{l) 

and ,^(1) are the values of E[B] and E[R\ for a cache with no hysteresis. z/(l) is obtained 

from equations O, Q and dH). ,^(1) = j / (A/(A + /r)). 

Proposition 2.3. z/(i + 1) and .^(i + 1) can be obtained by the following recursions: 

uii) = - + uii — 1) + pail) i>2 (19) 

e(*) + 2<t<K-Kh + l (20) 

Proof: The proof follows from renewal arguments, and is omitted due to space limitati¬ 
ons. □ 

It is important to note that explicit expressions for v{i) and can be obtained 
as a function of A, p and K but details are omitted since the recursion above suffices to 
explain our comments. 

Suppose A and vr^p are given and we obtain K and p, for instance from the op¬ 
timization problem in the previous section. We allow Kh to vary from Kh = K (that is 
reinforced counter without hysteresis) to Kh = 0. 

Proposition 2.4. As Kh decreases, the rate ■y{K,p) at which content enters the cache 
also decreases. 


Proof: From Proposition 12.31 it is not difficult to see that both E[B] and E[R\ increase 
with Kh- Since 7 (iT, p) = 1/{E[B] + E[R]) (equation (|3])) the result follows. □ 


















Proposition 12.41 shows that, from an initial value of if we fix the parameters 
A and K, the rate at which content is replaced (both included and removed from cache) 
decreases by using the hysteresis mechanism, which is good to lower costs as explained in 
the previous section. However, also varies. As a consequence of Proposition 12.31 we 
can show that vr^p is reduced. This is not obvious since E[B\ increases. But, by adjusting 
the knob /i, 7 r„p can be maintained constant while 7 is reduced when the hysteresis schema 
is used. The proof of this last result is omitted but it follows from Proposition 12.31 The 
numerical results presented in the Section [2A] corroborate the claim. 

There are additional advantages of the hysteresis mechanism. First note that, in 
the previous sections, we assumed that file download times are negligible. However, when 
a user requests for a content it is important that the whole file remains stored at the cache 
not only until this user finishes downloading but also while other users are downloading 
the same content from that cache. Hysteresis is helpful to prevent the file from being 
removed before its download is concluded by all requesters. In Section [Z4l we show that 
hysteresis increases the probability that a content remains in cache for at least some time 
t after it is cached. Note that this is an additional (transient) performance measure and it 
differs from the 7 metric (steady state rate). As our numerical results show, by adjusting 
Kh we can improve both steady state and transient metrics. 

Our numerical results also show that hysteresis reduces the coefficient of variation 
of the time that content resides in the cache (B). This is important for cache capacity 
planning with multiple contents, since more predictable systems are usually easier to 
design and control. 

2.4. Numerical Results 

In this section we show some numerical results obtained for the models: the rein¬ 
forced counter with a single threshold (K) and the reinforced counter with two th¬ 
resholds (K,Kh). Three performance measures were used to analyze the models: the 
rate at which content enters the cache ( 7 ), the cumulative distribution of the time the con¬ 
tent takes to return to the cache (R) and the cumulative distribution of the time the content 
remains in the cache after insertion (B). 

Three scenarios were evaluated: (a) the reinforced counter has a single threshold 
K = 11, (b) the reinforced counter has two thresholds K = 11 and = 7, and (c) 
the reinforced counter has two thresholds K = 11 and Kh = 2. For each scenario, we 
consider the same value of vr^p. A, and K. 

The value of 7 for scenario (a) is 0.24, for (b) is 0.07 and for (c) is 0.04. We 
note that the rate at which content enters or leaves the cache decreases as the value of 
decreases. This is one of the advantages of introducing a third control knob K^. 

Figures[ 6 ta) and| 6 tb) show the cumulative distribution of R and B. We note that the 
reinforced counter with hysteresis allow to control the probability distribution of R and B. 
In Figurel^a), consider for example f = 4. If we set = 2, we have P[R < 4] = 0.35 
and if = 7, then P[R < 4] = 0.65. As the value of increases, the probability of 
R be less than a certain value of t increases. This behavior can also be observed for the 
distribution of B. On the other hand, if we consider the model with a single threshold we 
can not control the distribution of R and B. In Figure Oa), P[R < 4] = 0.9. 








P[R<t] 


Another advantage of the reinforced counter with hysteresis, is that the coefficient 
of variation of R and B, decreases with the value of Kh, which means that the dispersion 
of the distribution of R and B also decreases. The values obtained for the coefficient of 
variation of B for each scenario are: (a) 1.6, (b) 1.3 and (c) 1.1. 




t 


t 


(a) Cumulative Distribution of R 


(b) Cumulative Distribution of B 


Figura 6. Cumulative Distribution of the time the content takes to return/remains 
in the cache. 


3. Multiple Caches 

In the previous section we argued that we can decouple the cache storage dynamics of 
each content and study each content in isolation. Roughly, decoupling is a consequence 
of statistical multiplexing, for content placement policies that limit the amount of time a 
single content is cached, independently of the other requested contents. This is the case 
of the reinforced counter policies we study. We showed the advantages of these policies 
and the flexibility they bring to the design of caches. 

In this section we address the problem of multiple caches in a network. The main 
objectives are: (a) to formulate the problem of content placement at a cache network; (b) 
to show that if we do not use a decoupling policy such as those we study in this paper, the 
optimal content placement problem is NP hard. This last result emphasize the importance 
of employing a cache placement mechanism like the reinforced counters. 

3.1. Model and Problem Formulation 

Let and C be the set of files and caches in the system. There are F = \F\ files and 
C = \C\ caches in the system. File / has size tf, 1 < f < F and cache i has (storage) 
capacity Si, 1 < i < C. Users issue exogenous requests for files. At each cache i, 
1 < i < C, exogenous requests for file j arrive at rate Xij. We assume that cache i has 
service capacity rji requests/s. Once a request arrives at a cache, 

1. in case of a cache hit, the content is immediately transferred back to the requester 
through the data plane. Our model is easily adapted to account for the delay in 
searching for a file in the cache, but for the sake of presentation conciseness we 
assume zero searching cost in this paper; 

2. in case of a cache miss, the request is added to the queue of requests to be servi¬ 
ced. When the request reaches the head of the queue, it is transferred to one of the 
outgoing links, selected uniformly at random. 
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Figura 7. Illustration of cache network 


Let Aij be a variable that indieates if file j is available at eache i. ^From the results 
of previous seetion, Atj = 1 with probability 7r„p for eontent j at cache i. Therefore, we 
can compute P[Aij = 1] using the derived results. In addition, we may also study cases 
in which content is statically placed into caches, i.e., the content in the caches is not 
replaced. This corresponds, for instance, to a system where the replacements of files from 
the caches occur at a much coarser granularity than the requests. In this case, A^j = 1 if 
file j is at cache i and 0 otherwise. 


Let Mhi be the adjacency matrix between caches, i.e., M^i = 1 if there is a direct 
path from cache h to cache i and 0 otherwise. 


Let dh be the outgoing degree of node h. Let phi be the probability that a request 
from cache h is routed to cache i. In this paper, we consider random walk routing ( 
UDomingues et al. 2013HDomingues et al. 2014] ), i.e., pu = l/|dh| if Mhi > 0 and 0 
otherwise. 


Figure |7] illustrates the problem in the case where we have a cache network com¬ 
prised of 3 nodes, j, k and I, and 3 contents, indexed by 1, 2 and 3. In the Figure, the 
exogenous arrival rates ... of contents 1, 2 and 3 to cache j are made explicit. 

Let Trip {i) the designed parameter in previous sections, but for cache i and content 
(c). Let represents the total input rate of requests to content (c) arriving at cache j 
(exogenous arrivals to cache j plus those requests coming from other caches). A fraction 
(j) is immediately served, and fractions (1—vrip {j))pjk and a^'^\l—7Vup {j))pji 
are transferred to caches k and /, respectively. 

In the remainder of this paper, we will assume that requests for files that are cached 
at i, 1 < i < (7, are immediately processed at i, and we will be concerned with transfers 
that occur due to misses. 


af = Xf + 

k=l,k^j 


( 21 ) 


We can compute the mean response time from the expected total number of re¬ 
quests in the system. Using Little’s law, and recalling that the processing rate is pi we 
have that the average number of requests at cache i is E[Ni] = at/pi. Then, the expected 
total number of requests in the system is E[Ni] and the mean response 










time, E\T] = E[N]/K where A is the sum of the exogenous arrival rates to all eaehes. 

Definition 3.1: The eaehe network is stable if and only if rji > ai for alH, 1 < i < C. O 

Until this point, we foeused on the use of reinforeed eounters for eontent plaee- 
ment. In the remainder of the paper, we show that without reinforeed eounters a natural 
statement of optimal eontent plaeement yields an NP hard problem. 

Definition 3.2: The optimal content placement problem consists of finding the mapping 
A : E ^ C such that 

1 . the cache network is stable 

2. x;/=i < Si, for i = 1,... F 

3. the arrival rates of requests that require processing at the control plane, 
is minimized 


3.2. Static Content Placement in Cache Networks is NP Hard 

We show that the problem of static optimal content placement in cache networks is NP 
hard. This result is interesting because it shows the importance of using a cache repla¬ 
cement policy like those we study in this paper. Recall that, by using the reinforcement 
policy, we can optimize the counter parameters to satisfy a given probability of finding 
a content in cache for each content in isolation. These probabilities are in turn obtained 
to satisfy a given cache capacity constraint, from statistical multiplexing arguments, (the 
problem is identical to sizing a telephone network and one can use the results of that area.) 

First, we consider the case where different files have different sizes. In this case, 
it is possible to show that the problem is NP hard even if we need to make placement 
decisions at a single cache. To this aim, we can consider a mapping from Knapsack (the 
proof is omitted due to space constraints). Then, we consider the simpler case wherein all 
files have the same size. In this case, if we need to make placement decisions at a single 
cache, the problem can be solved using a greedy strategy. Nevertheless, we show that the 
problem is still NP hard if we need to make placement decisions in at least two caches. 
Theorem 3.1. The optimal content placement problem is NP hard. 

Proof: We refer to FeasibleC ACHE as the problem of deciding if a given cache network 
admits a feasible content placement. Note that the problem of finding the optimal place¬ 
ment, i.e. the PlacementProblem, must be harder than FeasibleCache, since sol¬ 
ving the optimization problem yields a solution to FeasibleCache. We proceed with 
a Turing reduction from the Partition problem to the FeasibleCache problem. The 
complete proof is omitted due to space limitations. □ 


4. Related Work 

The literature on caching UChe et al. 200911 and content placement UPresti et al. 200511 . 
and its relations to networking UNeves et al. 201411 . database systems and operating sys¬ 
tems iNelson et al. 19881 is vast. Nevertheless, the study of cache networks, which en¬ 
compass networks where routing and caching decisions are taken together at devices 
which work as routers and caches is scarce [Rosensweig et al. 2010[. In this work, we 













present a systematie analysis of eaehe networks using reinforeed eounters and indieating 
their applieability both in the single eaehe as well as multiple eaehe settings. 


Caehe networks play a key role in eontent eentrie networks 
(CCNs) HJaeobson et al. 20091 [Koponen et al. 2007[ |. CCNs are emerging as one of 
the potential arehiteetures for the future Internet. In CCNs, eontent rather than hosts 
is addressed. The glue that binds CCNs are the eontent ehunks rather than the IP 
paekets HJaeobson et al. 200911 . 


Using eaehes to reduee publishing eosts ean be helpful for publishers that eannot 
afford staying online all the time, or need to limit the bandwidth eonsumed to replieate 
eontent. With eaehes, eontent ean persist in the network even in the absenee of a pu¬ 
blisher [Koponen et al. 20071. In ease of intermittent publishers, eontent availability ean 
only be guaranteed in ease replieas of the eontent are plaeed in some of the eaehes. 


Approximate and exaet analysis of single eaehes and eaehe networks has been 
studied for deeades [ [Berger et al. 2014[lBianehi et al. 2013llFofaek et al. 20121 . Appro¬ 
ximate analysis is usually earried out using mean field approaehes UBianehi et al. 201^ 
or other asymptotie teehniques nielenkovic 199911 . whereas exaet analysis is performed 
aeeounting for Markov ehain properties [ [Fagin and Priee 1978 j or assuming uneoupled 
eontent dynamies UFofaek et al. 20121 . In this paper, we eonsider reinforeed eounters 
as the plaeement algorithms, whieh are at the same time amenable to analytieal study 
and of praetieal interest in the eontext of DNS systems or the novel Amazon ElastiCa- 
ehe [[Amazon 2014[[Berger et al. 2014| . Caehe polieies are traditionally devised to avoid 
staleness of eontent and/or improve system effieieney. The analysis presented in this paper 
shows that reinforeed eounters ean be used to target these two goals. 


5. Conclusion and Future Work 

In this paper we have studied the problem of sealable and effieient eontent plaeement in 
eaehe networks. We analyzed an opportunistie eaehing poliey using reinforeed eounters 
([ Domingues et al. 2013| ) as an effieient meehanism for eontent plaeement and proposed 
an extension to the basie seheme (reinforeed eounters with hysteresis). Using reinforeed 
eounters, we indieated how to tune simple knobs in order to reaeh the optimal plaee- 
ment. We then showed some of the properties of the meehanism and the optimal eontent 
plaeement problem. Finally, using the queueing-network model we propose for a eaehe 
network, we showed how to ealeulate the overall expeeted delay for a request to obtain 
a eontent. In addition, we showed that obtaining the optimal solution without reinforeed 
eounters is an NP hard problem. 

This work is a first step towards the eharaeterization of effieient, sealable and 
traetable eontent plaeement in eaehe networks. Future work eonsists of studying how to 
distributedly tune the eaehes, as well as allowing the presenee of eustodians that store 
permanent eopies of the files. 
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